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Abstract. For any finite set of integers X, define its sumset X + X to be {x + y : x,y € X}. 
In a recent paper, Martin and O'Bryant investigated the distribution of \A + A\ given the uniform 
distribution on subsets A C {0, 1, . . . , n — 1}. They also conjectured the existence of a limiting 
distribution for \ A + A\ and showed that the expectation of \A+ A\ is In - 11 + 0((3/4)™ /2 ). Zhao 
proved that the limits m(k) :— Iim n _ ) .oo P (2n — 1 — \A + A\ = k) exist, and that J2k>o m (k) = 1. 

We continue this program and give sharp bounds on m(k) for small k and also as k — > oo. Sur- 
prisingly, the distribution appears to be bimodal; sumsets have an unexpected bias against missing 
exactly 7 sums. We also derive an explicit formula for the variance of \A + A\ in terms of Fi- 
bonacci numbers. New difficulties arise in the form of weak dependence between events of the form 
{x e A + A}, {y E A + A}. We surmount these obstructions by translating the problem to graph 
theory. This approach also yields exponential upper and lower bounds (independent of n) for A + A 
missing exactly k sums, and the approximate probability for A + A missing a consecutive block of 
length k. 

Note: the computations concerning the existence of the divot are for 
n < 32, and are thus only suggestive and not conclusive proof of its 
existence. We are currently running larger computations for n < 45, 
which should conclusively prove the divot's existence. We expect to 
update this paper with those calculations by the end of June. 
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1. Introduction 

The central object of additive number theory UNUTVH is the sumset A + A of a set A of integers: 

A + A := {a x + a 2 : a x , a 2 G A}. (1.1) 

Typically, the theory is concerned with extremal behavior, such as the structure of finite A when 
| A + A\/\A\ is nearly minimal (Freiman's Theorem), or the possible densities of A when \A + 
A\/( l f) is maximized (Sidon Sets). See DHE1 for surveys and BBUI for examples. 

Here we focus on typical behavior: for a randomly chosen set A of integers, what is the expected 
value and variance of \A + A\l Though this problem was motivated by comparing \A + A\ to 
\A — A\, in this paper we concentrate on \A + A\; see [ILMZJ for recent results comparing the size 
of (generalized) sum and difference setsg. The answer to our question of course depends on how 
A is chosen, and we focus our attention on A taken uniformly from the 2 n subsets of [0, n — 1] (see 
IHM JMSl for some results on \A + A\ when we no longer choose subsets of [0, n — 1] uniformly); 
we denote intervals of integers as [a, b] := {x G Z : a < x < b}. In SI .31 and §17.21 we discuss 
some variations on the manner of choosing a random set of natural numbers. 

Other authors have considered aspects of typical behavior of sumsets. When Erdos and Renyi 
[ERJ first applied the probabilistic method to number theory, they observed that with probability 
1, a uniformly random subset C of N has C + C = N \ F for some finite set F, but made no effort 
to explore F further. The present work concerns itself with properties of the set 

F n := [0,2n-2]\(A + A), (1.2) 

with A as above. We prove the existence of 

limE[|F„n (1.3) 

for every r > 1, give upper and lower bounds on 

F(\F n \ = k) (1.4) 
for small k, large n, and also as k — » oo, and also bound 

P({ai,02,...,a fc }CF n ). (1.5) 

Our work is usually quantitatively effective, and we report numerical estimates throughout. 

The key obstacle to finding the limiting distribution of \F n \ is the dependence between different 
elements occurring or not occurring in A + A. For example, 3 G" A + A and 7 g" A + A are 
dependent events since both are affected by whether or not 2 is in A. We develop a graph theoretic 
framework which makes it much easier to visualize the dependence between such events and to 
develop bounds that incorporate the dependence. It is possible to avoid this framework, but doing 
so makes both the notation and the underlying issues less clear. 

Graph theory has been used in additive number theory before. For example, Pliinnecke (see the 
description in 0) uses graph theory to estimate the size of fc-fold sumsets in terms of \A\ and 
| A + A\; Alon and Erdos [AEJ use hypergraphs to study Sidon sets. Our use of graph theory seems 
to be different from these as we investigate the size of A + A for typical A, without reference to 
the size of A itself. 
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The next subsection of this introduction sets up our notation and states our main results. The last 
two subsections provide more motivation and indicate the nature of our proofs and computations. 
In §[2l we develop a graph theoretic framework for handling the dependencies between events like 
{ai G F} and {a 2 G F}. In §[3l we find an explicit formula for the limit of the variance of \F\ 
and prove Theorem 11.41 stated below. In §21 we prove the exponential bounds for Theorem 11.11 
In §[51 we find the probability of missing certain configurations and prove Theorem 11.51 while in 
§[6] we discuss consecutive missing elements and prove Theorem 11.61 We return to the problem 
of explicit bounds on P (\F n \ = k) for small k and the existence of a limiting distribution for \F n \ 
in §0 Finally in §(8l we discuss some problems for future research and how the graph theoretic 
framework may be applied to such problems. 

1.1. Terminology and Theorems. The main characteristic of A+ A is that it is almost full. Martin 
and O' Bryant HMOI proved that 

E [\A + A\] = 2n - 1 - 10 + O ((3/4) n / 2 ) . (1.6) 

Since typical sumsets are almost full, it is more natural to investigate the number of missing sums, 
which is why we write the above as 2n — 1 minus 10. As noted in HMOL sumsets are almost full 
because middle elements have many representations as a sum of two elements of [0, n — 1]; each 
i G [0, 2n — 2] has roughly n/4 — \n — representations. 
We set 

M [0l „_i] := \[0,2n-2]\(A + A)\ = 2n-l-\A + A\, (1.7) 
m n (k) := P (M [0)n _i] = k) , (1.8) 
m(k) := lim m n (k). (1.9) 

n— >oo 

A special case of Zhao's theorem [ZJ is that m(k) is well-defined, strictly positive, and that 
X]fclo m (^) = 1' so m at we can think of m(k) as defining a distribution on N. Thus, we can 
speak of "the probability that a large finite set A has a sumset that misses exactly 17 elements" 
and mean something sensible. Zhao's work is numerically impractical and did not give reasonable 
upper bounds onm(fc);we do that in §[71 where we also reprove Zhao's results in this easier setting. 
See Figure[T]for the experimental estimates and rigorous bounds on m(k) for < k < 32. 
The result (11.61) above implies that 

lim E [M[o,„-i]l = 10. 

n^oo L J 

Equivalently, in light of Zhao's work, J^fcLo km(k) = 10. To this, we add the following results. 
Let <fi := (1 + v / 5)/2 be the golden ratio. 

Theorem 1.1. Let n > 5k. Then 

2- k/2 < m n {k) < (0/2) fc , (1.10) 
where the implied constants are independent of k and n. 

Note that 2 -1 / 2 w 0.707 and 0/2 « 0.809, so that bounds provided by Theorem [Tj] are reason- 
ably close. We suspect, based on numerical data, that the following conjecture represents the truth 
of the matter, and perhaps even A = y/<f) — \. 

Conjecture 1.2. There exists A such that for any e > 0, 

(A - e) fc < e m(k) < e (A + e) k . (1.11) 
3 
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Figure 1. Experimental values of m(k), with vertical bars depicting the values 
allowed by our rigorous bounds. See §17 .21 for details. 



From numerical data, A ~ 0.78. 

The exponential bounds of Theorem 11.11 already imply that the rth moment remains bounded 
for any r > 1. 

Corollary 1.3. The limit of the r th moment of M[ 0jn _i], 

lim E [Mjp X ]] , (1.12) 

exists and is finite. 
Theorem 1.4. The limit 

lim Var(M [0 , ft _i]) (1.13) 

exists and is about 35.9658 (as these are the first digits of its decimal expansion). This limit can be 
written as the following convergent series with exponential decay: 

lim Var(Mr 0n _n) = 4VP(i and j & A + A) - 40. (1.14) 

i<j 

1 .2. Variance and Decay Rates of Missing Sums. The bounds in Theorem ITTTI are due to formu- 
las for probabilities of events such as 

P(ai, a 2 , anda m g A + A), (1.15) 

by which we mean the probability that all of oi, a 2 , . . . , a m are in the complement of A + A. This 
represents the probability that a particular configuration is not in A + A. As long as n > a m , 
there is no dependence on n since this probability just depends on [0, a m ) fl A. We therefore 
can assume that A C [0, a m ]. Formulas for such probabilities are also important for finding the 
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moments of M[ 0in _ii. For example, to find the expectation of \A + A\, HMO! find an exact formula 
for F(k $l A + A), which is approximately 

F{k£A + A) = 6((3/4) fc/2 ), (1.16) 

where we say g{n) = 9(/(n)) if there exist constants C\, C 2 such that for all n 

C x f(n) < g(n) < C 2 f(n). (1.17) 

Similarly, to find the variance, we can study P(i and j ; A + A) as seen from the series expansion 
in (11.1 4b . In Proposition |3.5[ we find an exact formula for this probability, and in Corollary 13 .61 we 
show that for fixed m we have the following approximation: 

F(k and k + m ^ A + A) = 6((0/2) fc ), (1.18) 

where <p is the golden ratio. The implied constants in (11.181) depend significantly on m, and in 
Corollary 13 .61 we also find these constants. 

Note that both (11.161 ) and (11.181 ) are exponential in A;. In fact, we prove that in general such 
probabilities are approximately exponential in k. 

Theorem 1.5. For any fixed CL\ , . . . , CL m , there exists \ ai ,...,a m such that 

F(k + ai, fc + a 2 , . . . , and k + a m £ A + A) = 6(A^ . a J, (1.19) 
where the implied constants depend on a±, . . . ,a m but not k. 

The fact that F(k + ai, k + a 2 , . . . , and k + a m ^ A + A) is approximately exponential supports 
Conjecture 1 1 .21 that the distribution of missing sums is approximately exponential. 

For the particular configuration a\ = l,a 2 = 2, . . . , a m = m, the case of consecutive missing 
elements, we can approximate A aiv .. a , n well as seen in the following theorem. 

Theorem 1.6. For any k, m 

/,x(fe+m)/2 /-|x(fe+m)/2 

(-J < F(k + l,k + 2,..., andk + m £ A + A) < f -J (1 + e m ) k , (1.20) 

with e m — > as m — > oo. To be more precise, the exact form of upper bound is (i / /2)( fc+m )/ 2 2 fc/ ' m . 

As we will see in the proof of Theorem II. 1[ the lower bound (l/2)( fc+m )/ 2 is essentially the 
probability of missing the first k + m elements in A + A. By Theorem II .61 we have that for large 
m, f(k + 1, k + 2, . . . , and k + m ^ A + A) is also approximately (l/2)( fc+m )/ 2 . This means that 
for large m, essentially the only way to miss m consecutive elements in A + A starting at k + 1 is 
through the trivial way - namely missing all of the first k + m elements of A + A. 

1.3. Other types of random sets and the divot. Figure Q] shows a surprising phenomenon: ex- 
perimentally, 

m(7) < m(6) < m(8). (1.21) 

That is, a random subset of [0, 10 10 ] is more likely to have a sumset missing 6 (or 8) elements than 
one missing 7. The distribution of M n appears to be bimodal for large n. While the rigorous bounds 
shown in Figure [T] do not prove that this tiny "divot" at 7 is real, conservative 99.9% confidence 
intervals for m(7), m(6), m(8) are 

0.07133 < m(7) < 0.07153, 0.07172 < m(6) < 0.07191, 0.07240 < m(8) < 0.07261. 
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As surprising as the reality of the divot will be, in light of these confidence intervals it would be 
even more surprising were it to not be. We have a large computation currently underway (details 
in §0 that we expect to sharpen our rigorous bounds enough to prove m(7) < m(6) < m(8). 
Closer inspection of Figured] also reveals an apparent parity effect: 

m{2k) + m{2k + 2) > 2m(2fc + l). (1.22) 

Here are two plausible explanations for this. The first is that M[o, n -i] is essentially the sum of two 
iidrvs: the number of missing sums in [0, n — 1] and in [n, 2n — 2]. For any two iidrvs Xi, X 2 
taking integer values, P {X\ + X 2 even) > P (Xi + X 2 odd), as the calculation comes down to 
x 2 + y 2 > 2xy. Another parity effect is observed on the ends: as soon as ^ A, then both and 1 
are not in A + A. Thus, on the ends, A + A always misses an even number of sums. 

To compensate for these observations, it is necessary to consider the connections between differ- 
ent ways of selecting a random set. We consider uniformly selecting subsets of [0, n — 1], subsets 
of [0, n] with diameter n, subsets of N, and subsets of N that contain 0. We lay out our notation as 
follows: 



set 


setting 


condition 


missing sums 


P(missing k sums) 


A 


[0,n-l] 





M [0 , n _i] :=2n-l-\A + A\ 


m n (k) 


B 


[0,n] 


{0,n} C B 


M[ , n ]|{o,n} := 2n + 1 - \B + B\ 


w n {k) 


C 


N 





M N := |N\(C + C)| 


y(k) 


D 


N 


€ D 


M N , {0} := \N\(D + D)\ 


z(k) 



Additionally, we set m{k) := lim^oo m n (k) and w{k) := lim^oo w n (k). 
Our first parity-effect observation essentially boils down to 

k 

m n (k) -> ^2y{i)y(k-i); (1.23) 

a rigorous exposition of this can be found in [IJ and is sketched in §17.21 The second observation 
and Bayes' Theorem leads us to 

00 |fc/2j 

y (k) = ^P(minC = z)P(|[2i,oo)\(C + C)| = k-2i) = ^ 2- {i+1) z{k - 2i). (1.24) 

i=0 i=0 

Similarly to (11.231) , one can prove that 

k 

w n{k) -> ^2,z{i)z{k-i). (1.25) 

i=0 

Thus, all four distributions can be understood in terms of z(k). Experiments and our bounds (see 
Figure [2] for small values of k) indicate that M N |{ } has an approximately geometric distribution, 
and exhibits no obvious parity effect. Computationally, we focus on bounding z and then allow 
this to determine bounds on m, w and y. 

We bound z by conditioning on / := D fl [0, 32], and loop over all 2 32 possible values of / (a pri- 
ori, G I). For each I C [0, 32], we explicitly know (D + D) fl [0, 32], we have much information 
concerning (D + D)(l [33, 63], and theoretically (D + D) fl [64, 00) is [64, 00) with high probabil- 
ity. This allows us to give reasonable upper and lower bounds on P (M N |{ } = k \ D fl [0, 32] = i) 
for each /, and it is then routine to combine these to bound z(k). The computation is 
already quite massive, and we will replace "32" with "45" when the 
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Figure 2. Experimental values of m(k), w(k), y(k), z(k), with vertical bars de- 
picting the values allowed by our rigorous bounds. See §|7]for details. 



computations finish. We will update the paper when this happens, 
probably sometime in June. 

If we suppose that Mm{o} is exactly geometric with parameter A (i.e., set z(k) = (1 — A)A fc ) 
and define y(k) and m(k) using (11.231 ) and (11.241) . we find that the distribution of M N |{ } would 
be bimodal with a divot at k = 7 only for the narrow parameter range 0.756 < A < 0.771. 
The best-squares for A is 0.765. If we suppose that Mn|{ } has a Poisson distribution, i.e., 
z(k) = X k e~ x /k\, we find that there are no A whatsoever that give a bimodal distribution with 
divot at k = 7. 

This implies that the divot's existence relies not only on the above observations but also on the 
specific values of z k for small values. We note that z A in particular is larger than the geometric 
model predicts; more than half of the least-squares error is from z±. The rigorous bounds we give 
also show this bias towards 4, though we currently have no understanding as to why this is the 
situation. 

Theorem 1.7. The limits defining m(k) and w(k) are well-defined, positive, and Yl C k=o m (^) = 
J2'k'=o w (k) = 1- Rigorous bounds on m(k), w(k), y(k) and z{k) for < k < 32 are given in Ap- 
pendix^ These numbers will be replaced with the results of computations 
up to 45 by late June. 



This was fitting the values, but it would be more appropriate to fit the logarithms. We do not pursue this as our 
goal here is only to get a general sense of the behavior. 
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2. Graph-Theoretic Framework 



We first develop a graph-theoretic framework to study dependent random variables and calculate 
probabilities like P(a 1; . . . , and a m £ A + A). Note that for odd i 

{i&A + A} = {(O^AorigA) and ■ ■ ■ and ((i - l)/2 £ A or (i + l)/2 £ A)} , (2.1) 

and for even z 

{z ^ A + A} = {(0 £ A or i £ A) and ■ ■ ■ and (i/2 - 1 £ A or i/2 + 1 A) and z/2 £ A}. 

(2.2) 

Therefore for distinct i the events {? ^ A + A} are dependent as both depend on conditions on A 
like {0 ^ A}. 

For example, the conditions on A necessary for {3 and 7 ^ A + A} are 

z = 3 : or 3 ^ A j = 7: or 7 ^ A 

and 1 or 2 ^ A and 1 or 6 ^ A 

and 2 or 5 £ A ( ^ 

and 3 or 4 £ A. 

Since the two lists have integers in common, there is dependence between the events {3 ^ A + A} 
and {7 £ A + A}. 

We construct a graph to represent the dependencies between the random variables. We call this 
graph the condition graph for the probability. We construct the condition graph for P(ai, . . . , and 
a m £ A + A), where a± < • • • < a m , in the following way: 

(1) For every integer in [0, a m ], add a vertex labeled with that integer. 

(2) Add an edge between two vertices labeled with i and jifi + j — a k for some 1 < k < m. 
See Figure [3] for the condition graph for P(3 and 7 ^ A + A). 




Figure 3. Condition Graph for P(3 and 7 g A + A). 



By construction, we have a one-to-one correspondence between edges and conditions and ver- 
tices and integers in [0, a m ] . For example, the edge between vertices labeled with 1 and 6 represents 
the condition that 1 or 6 ^ A, which is one of the conditions necessary for 7 ^ A + A in (12.31) . For 
each condition, we need to pick at least one element to exclude from A. Therefore in the condition 
graph, for each edge we need to pick at least one of its vertices. That is, we need to pick a vertex 
cover (recall a vertex cover of a graph is a set of vertices such that each edge is incident to at least 
one vertex in the set). Using this method, we get the following lemma. 

Lemma 2.1. P(ai, . . . , and a m (jL A + A) equals the probability that we chose a vertex cover for 
the condition graph. 
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Note that when we pick vertices in the condition graph for our vertex cover, we are picking to 
exclude those vertices from A. For example, note that the vertices 7, 0, 4 and 6, 2 form a vertex 
cover for the condition graph of P(3 and 7 ^ A + A) in Figure [3] Then if 7, 0, 4, 6, 2 ^ A, then 
3 and 7 ^ A + A since all conditions in (12.31 ) are met. 

Finally, note that when we calculate the probability of choosing a vertex cover for the condition 
graph, we no longer need to consider a labeled graph. This is because vertices represent elements of 
A, and since each element of A is equally likely to be chosen (as A is chosen uniformly randomly), 
we do not need to differentiate between different elements. 



3. Variance of Missing Sums 

We now use the graph-theoretic framework from the previous section to prove Theorem 1 1.41 and 
find the variance. 

We first note that the result of HMOII in (fL6l) is really that 

E[M n (A)} = Yl Hi$A + A) = 10 + O((3/4) n / 2 ). (3.1) 

0<i<2n-2 

Since 

Var (M n (A)) = E [M n (A) 2 } - (E [M n (A)]) 2 (3.2) 

and we know E [M n (A)] from (13.11 ), to find the variance we just need to determine E [M n (A) 2 ], 
which equals the following: 



E [M n (A) 2 ] = ^ |{ missin g sumsm ^ + ^}P 

= [0,n-l] 

E E i 

= [0,n-l] 0<i,j<2n-2 
i,j£A+A 



2 r > 
1 

2 n 



0<i,j<2n-2 AC [0,n-l] 
i,j£A+A 



Y F ( A [0, n - 1] : i and j £ A + A) 

0<M<2n-2 

2 Y p (* and j ^ A + A) + ^ P(z £ A + A). (3.3) 



0<i<j<2n-2 0<i<2n-2 



Combining (13721) . (I37TI) . and (1331) . we get 

Var(M n (A)) = 2 ^ P(i and j £ A + A) - 90 + 0((3/4)™ /2 ). (3.4) 

0<«<j<2n-2 

We first simplify the sum over z, j. Note that if i, j < n, then 

P(i and j g A + A) = P(2n -2-i and 2n - 2 - j g A + A), (3.5) 

and so 

P(i and j g A + A) = ^ p (* and j &A + A). (3.6) 

0<i<j<n n<i<j<2n-2 



Also, note that if i < n/2 and j > 3n/2, then {i ^ A + A} and {j ^ A + A} are independent. 
This is because {% ^ A + A} depends only on [0, i\ D A and {j A + A} depends only on 
\j — n + 1, n — 1] fl A and if i < n/2 and j > 3n/2, these sets are disjoint. Therefore for such i, j, 
we have 

P(i and j £ A + A) = P(i g A + A)¥(j & A + A). (3.7) 
Finally note that if n/2 < i < n or n < j < 3n/2, then 

P(i and j # A + A) = 0((3/4) n/4 ) (3.8) 

by (fTT6l) . Therefore 

^2 p (« and j &A + A) 

i<n, n<j 

V P(i and j £ A + A) + P(i and j g A + A) 

i<n/2 and 3n/2<j n/2<i<n or n<j<3n/2 

^(i and j £ A + A) + 0(n 2 (3/4) n/4 ) 

i<n/2, 3n/2<j 

^ P(z A + A) ) ■ I £ v(j#A + A)\+ 0(n 2 (3/4)"/ 4 ) 

K i<n/2 J \3n/2<j<2n-2 J 

= (5 + 0((3/4) n/4 )) • (5 + 0((3/4)" /4 )) + 0(n 2 (3/4) n/4 ) 

= 25 + 0(n 2 (3/4) n/4 ), (3.9) 
where we use (13.11) and (13.51) to get the second to last equality. Combining (13.61) and (13.91) , we have 

p (« and j &A + A) 

0<i<j<2n-2 

= Y p ( z and j ^ A + A) + ^ p ^ and 3 & A + A) + ^ and j &A + A) 

0<i<j<n n<i<j<2n—2 i<n, n<j 

= 2 p (* a ndj £ S + S) + 25 + 0(n 2 (3/4) n/4 ), (3.10) 

0<i<j<n~l 

and so by (T3~4l) 

Var(M n (A)) = 4 ^ P(« and j ^ S + S) - 40 + 0(n 2 (3/4) n/4 ). (3.11) 

0<i<j<n— 1 

Therefore to find the variance, we just need to study F(i and j <^ A + A) for i < j < n. 

Since the other cases are handled similarly, we only present the details for the case when i and 
j are both odd. By Lemma [2TT1 we just need to study the condition graph for P(i and j £ A + A). 
Recall that we already found the condition graph for P(3 and 7 ^ A + A) in Figure [3l After 
untangling this graph, we see that it really consists of two components, as seen in Figure HI 

7 3 4 6 1 2 5 



Figure 4. Untangled condition graph for P(3 and 7 ^ A + A). 
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Also note that each component is a segment graph, a graph that consists of a sequence of vertices 
such that each vertex is connected only to the vertices to its immediate left and right. A similar 
situation holds in general, as seen by the following proposition. 

Proposition 3.1. The condition graph for P(i and j (jL A + A) has components that are segment 
graphs. 

Proof. The condition graph for P(z and j ^ A + A) has vertices with degree less than or equal to 
2; if the vertex is labeled with I, it can only be connected to vertices labeled i — £ or j — £ (if such 
vertices exist). 

Furthermore, there are no cycles in the condition graph. Suppose there is a cycle in the condition 
graph. Consider the vertex in the cycle with the maximum label i and consider the vertices around 
this vertex. Each of these vertices must have exactly two neighbors and so we have the following 
situation as seen in Figure [51 



Figure 5. Vertices around a labeled vertex I. 

Notice that £ + j — i > £ since j > i. Therefore, £ is not the maximum label, which is a 
contradiction and proves that we cannot have a cycle. Thus all components are trees with all 
vertices having degree less than or equal to 2, implying that all are segment graphs. □ 

Since labels in different components are distinct and there are no edges between different com- 
ponents, each component is independent. That is, the probability of getting a vertex cover for the 
entire graph is the product of the probability of getting vertex covers for each component. In this 
way, we just need to find the probability of getting a vertex cover for each component. To do this, 
we find the number of vertex covers for an arbitrary segment graph, which we do in the following 
proposition. 

Proposition 3.2. The number of vertex covers g{n) for a segment graph with n vertices satisfies 
g (n) = F n+ 2, where Fk is the k th Fibonacci number. 

Proof. There are two cases: the first vertex of the segment graph is in the vertex cover, or it is not. 
If the first vertex is in the cover, then the first edge already has one of its vertices picked. Therefore 
we just need a vertex cover for the subgraph with n — 1 vertices that follows the first edge, and 
by definition there are g(n — 1) such covers. If the first vertex is not in the cover, then the second 
vertex must be the cover since the first edge must have one of its vertices chosen. Since the second 
vertex is now in the cover, then the second edge automatically has one of its vertices in the cover. 
Therefore we just need a vertex cover for the subgraph with n — 2 vertices that follows the second 
edge, and by definition there are g(n — 2) such vertex covers. Therefore, we have the Fibonacci 
recursive relationship g(n) = g(n — 1) + g(n — 2). As g(2) = 3 = F A and g(3) = 5 = F 5 , these 
initial conditions and the recurrence imply g{n) = F n+2 , completing the proof. □ 

Therefore, we have 

p 

P(chose a vertex cover for a segment graph with n vertices) = — . (3.12) 

2 n 

n 



Returning to our example with 3 and 7, we note that since the condition graph in this case 
consists of two segment graph components each of length 4, we have 

P(3 and 7 $M + A) = ^ • ^ = i, (3.13) 

where we can multiply the probabilities by the independence of the components. 

In general, as the condition graph may have many components we must find how many segment 
graph components there are in the entire graph for F(i and j ^ A + A) . 

Proposition 3.3. There are (j — i)/2 segment graph components for the graph ofF(i and j ^ 
A- A). 

Proof. Note that in total j + 1 vertices are used in graph of F(i and j ^ A + A); since {i and j ^ 
A + A} depends just on A fl [0, j] , the graph uses exactly the integers in [0, j] . Also note that each 
component must end with a vertex labeled by an integer greater than i. If a component ends with a 
vertex labeled by £ < i, then it can be connected to two other vertices i — £ and j — £. Remember 
that we are assuming i and j are odd (the other cases are similar). As they are odd, i — £ ^ £ and 
j — £ 7^ £ and so i — £, j — £, £ are all distinct. Since £ is connected to two other vertices, it cannot 
be an end vertex. Therefore, each end vertex is labeled by some integer in [i + 1, j). Also note that 
each of these integers must be end vertex since it cannot be used to add up to i. Therefore, the set 
[i + 1, j] coincides with the set of end vertices and since each component has two end vertices with 
distinct labels, there are (j — i)/2 components. □ 

We also need to find the length of each component. Fortunately, there are only two possible 
component lengths for the graph of F(i and j A + A), as seen by the following lemma. 

Proposition 3.4. The length of each segment graph components for the graph of F(i and j 
A + A) is always either 



2 



"z + 1" 








or 2 




j-i 







Proof. First note that the difference between a given vertex and another vertex that is two edges 
away is j — i. This is because the sum of the vertices that share an edge alternates between i and 
j, so that we have segments of the form given in Figure [6] The difference between j — x and i — x 
is j — i as needed. 



x 



FIGURE 6. Difference between every other vertex. 



Now note that these differences can be used to determine the size of each component. Suppose 
the end vertex of a segment graph is m. Since we decrease by j ' — i for every two vertices and since 
we only use non-negative integers, there can only be 



rn 



+ 1 



m + 1 

3 ~ i 



(3.15) 
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decreases. Since we decrease once for every two vertices, we have that the length is twice the 
number of decreases. Therefore the length is 

m+1 



3 - * 



(3.16) 



From Proposition 13.31 we also know that the end vertex m of each segment graph satisfies 
i < m < j. Therefore, the length of each segment graph is always 



or 2 



i + 1 



+ 2, 



as desired. 



(3.17) 

□ 



For simplicity, we denote the first of the two values in (13.141) by q and the second by q + 2. We 
must find the number of components with size q and q + 2. Suppose there are r components of size 
q and r' components of size q + 2. Then conditions on the number of components from Proposition 
I3.3l and the length of each component from Proposition 13.41 gives us the following two equations: 

qr + (q + 2)r' = j + 1 



r + r 



Solving these equations for r, r' in terms of q gives 



(3.18) 



J -* 



q-{i + l) 



2U + 1 



-Q 



1 



2 I 3 + 1 - U 



i + 1 



Therefore, again by independence of components, we have for odd i,j that 

1 



(i and j g A + A) 



(3.19) 



(3.20) 



2j+l <?+ 2 9+ 4 

with q, r, r' as given in (13.171) and (I3.19I ). Arguing similarly leads to formulas for the other three 
cases, which we state below. 



Proposition 3.5. Consider i,j such that i < j. 
Fori,j both odd: 



where 



(i and j A + A) 

"i + 1" 
j-i 



rpr rpr' 

23+1 q+2 r q+4 



(3.21) 



- 2 [{j-i) 



i + 1 



2 ( 3 + 1 - U ~ i) 



-(i + 1) 
j-i 



For i even, j odd: 



(i and j £ A + A) 
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Tjt rpr rpr 

23+1 ° 3+2 ^9+4 



(3.22) 



(3.23) 



where 



i/2 + 1 

j-i 
'i + l 



- 1 



J -i 



r = 



r = 



- ( a - * - 1) 



'i + l' 



J-i 
i + l 



(i + l) + o 



J -i 



For i odd, j even: 



where 



(i andj g A + A) 
+ 1 



TP jpr rpr 



' i + l ' 
j-i 



- 2 



i + l 



3 -i 
i + l 



[i + l) + o 



J -i 



Fori,j both even: 



where 



F(iandj?A + A) = —F^F^F^ 
'i/2 + 1 



o = 



J -i 
J/2 + 1 

j-i 
i + l 



- 1 

- 2 



J -i 



r = 



r = 



2 (0'-i-2) 



i + l 
j-i 



-[j-l-(j-i-2) 



- (i + l) + o + o' 
i + l 



3 -i 



o — o 



(3.24) 



(3.25) 



(3.26) 



(3.27) 



(3.28) 



We conclude this section with some bounds on ¥(i and j £ A + A). We have (Binet's formula) 

1 



V5 



(3.29) 
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where <p = (1 + \/5)/2 is the golden ratio. Therefore, for even n we have 



F n < (3.30) 



Since q + 2 and g + 4 are always even, then for any i, j both odd, we have 



< 



l U q+ *Y fd) q+A Y' 



1 0(gr+(g+2)r')+(2r+2r') 
2i+l 5(r+r')/2 

1 

02J+1 gi/4 

2i+ I 5i7 4 ' 



(3.31) 



where the second to last equality comes from (|3.18l) . In fact, we can use Proposition 13.51 to show 
that (|3.31l) holds for all i, j (slightly better constants hold for the other i, j). 

Ifi = k and j = k + m, where m is fixed and k goes to infinity, a lower bound similar to (13.311) 
also holds. First note that for even n 

1 xnr (l - 4>- 2n y 



5 r / 2 

= -L0-(l-r(l-crV- 2n ) (3.32) 
for some c such that < c < 1/ <p 2n by Taylor expansion. Therefore for odd i, j, we have 

F(iwdj?A + A) = ^L F r q+2 F^ 

1 1 ./..„,_. 1 , ,w . . 

:> 





-2)/2 


02,+l 


5 */4 


2i+i5i/4 




02,+l 


5 */4 


25 +1 55'/ 4 


0* 


02 i+ l 


5 */4 



^ v v ; 5 (<h-4)/2^ v y y 

(l_ r 0-2(</+2) )(1 _ r '0-2(9+4)) 

> ; r— (1 - (r + r')<r %+2) ) 

- 2-? +1 5^/ 4 cj) 1 y y ' ' 

A2j+1 ei/4 

> ^ (l-(j-iU- i/(j - j) ), (3.33) 

and similar formulas hold for the other parity cases. If j/i — >■ 1 not too slowly, then the remainder 
term on the right-hand-side of (13.331) goes to 1. For example, if i — h and j = k + m, then we 
have the following corollary by combining (13.311) and (|3.33l) . 
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Corollary 3.6. For any fixed m, 

j ) 2{k+m)+l rk/4, ik+l i2m 

F(k and k + m & A + A) „ . - — — - = -— — - — -, (3.34) 

v ' y- ' / 2( fc + m )+ 1 5( fc + m )/ 4 (h k 2 k+1 2 m 5 m / 4 

as k goes to infinity with k,k + m are both odd. Similar asymptotics hold for general k,k + m. If 
we ignore the constants related to m, we have 

¥(k and k + m £ A + A) = 6((0/2) fc ) (3.35) 

as k goes to infinity with any k,k + m. 

Note that since F(i and j ^ A + A) has exponential decay in i, j as seen in (|3.31l) . then (13.1 II) 
converges as n — > oo; that is 

lim Var (M n (A)) = 4 V P(i and j g A + A) - 40 (3.36) 

n— voo ' J 

i<j 

exists and is finite. In particular, we know that the limit is an infinite sum of Fibonacci products. 
However, we could not find a closed form for this sum. Nonetheless, because of the exponential 
decay in the terms in the sum, we can approximate the variance well. In particular, note that the 
tail of the sum has exponential decay: 



£ P(i and + A) < Ej(^)'( 



5 i/4 



< 
< 




< 87 £ < 87(0.81) n . (3.37) 



Here we use that (13.311 ) holds for all i, j. Using Mathematica to sum the first 300 terms of (13.361) , 
whose exact form is given in Proposition ^ .51 we get the following approximation for the variance: 

lim Var(M n (A)) = 35.9658 + S, (3.38) 

n— >oo 

where \S\ < 10~ 4 . The error term £ comes mostly from truncating the computation of the 300- 
term series given by Mathematica. By (13.371 ), the error term from truncating the series at n = 300 
is less than 87(0. 81) 300 ~ 3 • 10~ 28 , which is much less than the Mathematica error term. This 
proves Theorem 1 1.41 

4. Exponential Bounds 
We now prove Theorem [Tj] and find exponential bounds for the distribution of M n (A). 

Proof of Theorem [7771 For the lower bound, we construct many A such that A + A is missing k 
elements. First suppose that k is even. Let the first k/2 non-negative integers not be in A. Then 
let the rest of the elements of A be any subset A' that fills in (so A' + A' has no missing elements 
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between its largest and smallest elements); that is M„^ k / 2 (A') = 0. By [MO, Proposition 8], we 
can show that 

F(M n (A') = 0) > 1/2 10 (4.1) 

independent of n. If L C [0, £ — 1] and [7 C [n — it, n — 1] are fixed, then their proposition says 
that 

P([2*-l,2n-2u-l] C A'+A' | A'n[0,£-1] = L,A'n[n-u,n-l] = U) > l-6(2- |L| +2- |c/| ), 

(4.2) 

independent of n. Therefore, 

F{[2£ - 1, 2n - 2u - 1] C A' + A' and A' n [0, £ - 1] = L, A' n [n - m, n - 1] = C/) 

> (l-6(2- |L| + 2- |l/| ))2- £ 2- u . (4.3) 

By letting L = [0,£ — l],U = [n — u, n — 1] so the ends fill in, we get that 

¥(A' + A' = [0, 2n — 2]) > (1 - 6(2"' + 2- u ))2^2 _u . (4.4) 

Letting £ = it = 4 so that the first term in the product is positive, we get that 

F(A' + A'= [0,2n-2]) > (1 - 6(2~ 4 + 2- 4 ))2~ 4 2" 4 = 1/2 10 , (4.5) 

independent of n, which gives us (14.1b . 

As A = k/2 + A', we have A + A = k + A' + A' = [k,2n - 2] and so M n (A) = k as seen by 
Figure [7J 



A: 



k/2 



any set that fills in 



A+A: 







completely filled in 



Figure 7. A and A + A for lower bound. 



Therefore we have 

P(M n (A) = k) > F(A = k/2 + A' and M n _ fc/2 (A') = 0) 

fc/2 



^-J P(M n _, /2 (A') = 0) 



> 



fc/2 



(0.70) fc , 



(4.6) 



where the implied constants are independent of n by (14.1b . This proves the lower bound in Theorem 
ll.ll when k is even. 

If k is odd, then we can let L = [0, £ — 1] \ {2, 3} and U = [n — u, n — 1] so that only the 
element 3 is missing from A' + A'. Then we get a bound for F(M n (A') = 1) as in (|4.6b . Letting 
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A — (k — l)/2 + A', we get the desired lower bound in Theorem ll.ll for when k is odd. 



For the upper bound, we can use bounds like 

£ A + A) < [ - A ) (4.7) 



from HMOH . Again, first suppose that k is even. Note that if A + A is missing k elements, then one 
of these missing elements must be at least k/2 elements away from the ends of [0, 2n — 2] . That is, 
we have the following situation (see Figure [8]). 

i i i in 

'-T-' T «-r- J 



k/2 | k/2 

missing sum more than k/2 from ends 



Figure 8. Upper bound for F(M n (A) = k) 



Therefore 

/ON fe/4 

F(M(A) = k) < F(A + A missing element k/2 away from edges) < f - J ~ (0.93) fc . (4.8) 

Note that this bound does not use the fact that there may be missing elements on both ends. We 
can combine information about missing elements on both ends to show that 

P(iC [0,n-l] : \A + A\ = 2n-l-k) < k I -J < (0.87) fc , (4.9) 

for k < n/2 instead of the previous (3/4) fc / 4 . 

We first introduce some new random variables to keep track of the missing elements. Let L(A) = 
max{m < n — 1 : m — 1 ^ A + A} denote the index of the last missing element of A + A in 
[0, n — Similarly, let R(A) = max{m < n — 2 : 2n — 1— m ^ A + A] denote the index of the 
last missing element of A + A in [n, 2n — 2} counted from right to left. Then 

F(M n (A) = k) = F(M n (A) = k and L(A) < n/2 and R(A) < n/2) 

+ F(M n (A) = k and n/2 < L(A) or n/2 < R(A)) 
= F(M n (A) = k and L(A) < n/2 and R(A) < n/2) + 0((3/4) n/4 ), (4.10) 

where we use F(k g A + A) = 6((3/4) fc/2 ) to get the last equality. Also, note that if M n (A) = k, 
then L(A) + R(A) > k since the total number of missing sums on the left and right fringes must 
be at least k. Then we have 

F(M n (A) = k and L(A) < n/2 and R(A) < n/2) 
< F(L(A) <n/2 and R(A) <n/2 and L(A) + R(A) > k) 

= ^F(L(A) =i <n/2;mAR(A) = j <n/2). (4.11) 

i+j>k 
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Finally, note that L(A),R(A) are independent if they are both less than n/2. As in §@1 this is 
because i A + A depends only on [0,z] D A and j ^ A + A for j > n depends only on 
\j — n + 1, n — 1] fl A. Thus we have 

¥(M n (A) = k and L(A) < n/2 and R(A) < n/2) 

= H L ( A ) = i < n/2) ■ ¥(R(A) =j< n/2) 

i+j>k 



« E 



3 \»/2 /g\ j/2 



47 V4 



4 

m>k 

Combining (|4.10l) and (14.121) . we get that 



i+j>k 

J>(2) <*(2) . (4.12) 



g\ m/2 /^\k/2 



k/2 /g^ n/4 



P(Af n (A) = A:) « fc(-) +( 7 ) . (4.13) 



Therefore if A; < n/2, then we get the desired bound 

/ 3 \ fc/2 

P(M ft (A) = A;) < fcf-J < (0.87) fc . (4.14) 

Note that the bound in (14.141) for the distribution is exactly the same as the bound in (14.71) for a 
single element. Therefore, we can always use this approach to transform bounds on the probability 
of missing elements in (A + A) fl [0, n — 1] to equally good bounds on number of missing elements 
in all of A + A. So it is sufficient to just develop bounds on missing elements on one side of A + A. 
In particular, we can use this approach to transform the bounds in Corollary 13.61 to improve the 
bounds in (I4.14I ). By Corollary 13.61 we have 

F(A + A misses k elements in [0, n — 1]) < ¥(A + A misses 2 elements greater than k — 3) 

Y ¥(i and j g A + A) 

k-3<i<j 

02i+l 5 i/4 



« E 



k-3<i<j 



« .L.u^ = ( ^ ] ~ 0.81 fc . (4.15) 



(,2fe+l 
2^+15^/4" 

Then using the previous approach and introducing L(A), R(A), we get a similar bound on the total 
number of missing sums: 



¥(M n (A) = k) < (Jl <(0.81) fe . (4.16) 

Note that as in (|4.10l) . we always have an extra (3/4) n / 4 term (since this is an upper bound 
depending only on one element). To make this term negligible, we need to have (3/4) n//4 < 
(0.81)\ which means n > k • 41og(0.81)/log(3/4) ~ 2.92A; or that k < OMn. This condition 
is sufficient in this case where we have the bound (0/2) k . However in general, we know that we 
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have a lower bound of (l/2) fc / 2 for the distribution. Therefore, to make the (3/4) n / 4 term always 
negligible, we can have (3/4) n / 4 < (l/2) fc / 2 , which means n > k • 21og(l/2)/log(3/4) ~ 4.82A; 
or that k < 0.2n. Note that then the implied constants are independent of n. Combining (14.61) and 
(14.161) , we get Theorem O □ 

5. Approximating P(k + ai, k + a 2 , . . . , and k + a m ) 

We now prove Theorem 1 1 .51 which says that for any fixed a 1; . . . , a m , there exists \ ai ,...,a m such 
that 

P(k + ai, k + a 2 , ■ ■ ■ , and k + a m A + A) = 6(A^_ a J, (5.1) 
where the implied constants depend on a\, . . . , a m but not k. Therefore, the probability is approx- 
imately exponential. 

To prove this theorem, we use a version of Fekete's Lemma, which says that sub-additive se- 
quences are approximately linear. From [SJ we have the following version in which the sequence 
is both sub-additive and super-additive. 

Lemma 5.1. If a n is a sequence such that 

a n + a m - 1 < a n+m < a n + a m + 1 (5.2) 
for all n, m, then A = inf a n j n exists and for all n, 



— - A 

n 



< -. (5.3) 

n 



Remark 5.2. The proof of this Lemma can be easily modified to get that if 

a n + a m -c < a n+m < a n + a m + c (5.4) 

for some constant c, then 



^-A 
n 



< -. (5.5) 

n 



Suppose that a n is approximately multiplicative rather than approximately additive so that 

C 1 ■ CL m a n < (Xm+n — C • & m Q"n (5-6) 

for all m, n. As log a n satisfies the properties of Lemma I5T1 we have 



log a n _ . 



< - (5.7) 

n 



n 

for all n. That is, 

e" c A" < a n < e c X n (5.8) 

for all n, implying 

a n = 9(A"). (5.9) 

Therefore we just need to relate P(k + a±, k + a 2 , • • • , and k + a m £ A + A) as a function of k to 
some approximately multiplicative function satisfying (15.61) . 

For example, consider P(18, 19, and 21 ^ A + A), whose condition graph is in Figured Note 
that this graph has a loop from vertex 9 to itself since 9 + 9 = 18. We can symmetrize this graph 
by removing this loop and also removing the edge between vertices 8 and 10 and the edge between 
vertices 9 and 10, resulting in the modified condition graph in Figure [101 

Denote the probability of getting a vertex cover for graphs like the one in Figure I77J1 of length n 
by f(n); so the probability of getting a vertex cover in Figure[10]is /(ll). 
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Figure 9. Condition graph for P(18, 19, 21 £ A + A). 



. K 2. 3 ^ 4. 5. 6 ^ 7. 8 9 10 




Figure 10. Modified condition graph for P(18, 19, 21 g A + A). 



Note that /(ll) is an upper bound for the probability in the original condition graph in Figure [9] 
since we have removed some edges. On the other hand, we have the following lower bound: 

P(18,19, and 21 A + A) 
> P(18, 19, 21 £ A + A and 9, 10, 11, 12 £ A) 

= P(16, 17, 19 £ A + A | 10, 11 £ A)P(9, 10, 11, 12 £ A). (5.10) 

Note that the condition graph for P(18, 19, 21 ^ A + A | 9, 10, 11, 12 £ A) is the original condition 
graph in Figure|9]with all edges incident on vertices 9, 10, 11 or 12 removed, as depicted in Figure 



0. K 2 ^ 3 ^ 4 5 ^ 6 7 8 




Figure 11. Condition graph for P(18, 19, and 21 £ A + A | 9, 10, 11, 12 £ A). 

Note that in Figure [TT] we have removed vertices 9, 10, 11 and 12 completely since there are no 
longer any conditions on them in P(18, 19, and 21 £ A + A | 9, 10, 11, 12 & A). Finally, note that 
the probability of getting a vertex cover in the graph in Figure[TT|is just /(9). Therefore, by (15.101) . 
we have 

(l/2) 4 /(9) < P(18, 19, and 21 £ A + A) < /(ll), (5.11) 
where we use that P(9, 10, 11, 12 £ A) = (1/2) 4 . 
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Since the condition graph for F(k, k + 1, and k + 3 ^ A + A) is just a longer version of the 
condition graph for P(18, 19, and 21 A + A), we can apply the same method as before to get 
that 

(l/2) 4 /(A:/2) < P(Jfe,ife + l, and A; + 3 £ A + A) < f((k + A)/2) (5.12) 

for even k, with a similar formula holding for odd k. Therefore we are reduced to studying f(n), 
which is easier to investigate since the condition graph is more symmetric. We will show that f(n) 
satisfies (15.61) , implying it is approximately exponential. 

For example, to see that /(ll) < /(4)/(7), we can separate the graph in Figure [10] at the 4 th 
vertex and remove edges that cross this gap, resulting in the graph in Figure [T2l 



0.1 2 3 4.5.6.7.8 9 10 




Figure 12. Upper Bound for /(ll). 

Since the components are independent smaller copies of the original, the probability of getting 
a vertex cover for the graph in Figure [121 is /(4)/(7). We can do this for any integer less than 
11, defining f{n) for small integers by truncating at the n th vertex. Since we have removed some 
edges to get the graph in Figure [121 we have 

/(ll) < /(4)/(7) (5.13) 

as desired. 

To get a lower bound for /(ll), we use that 

/(ll) > /(ll | 4,5,6,15,16, 17)P(4,5,6,15, 16, 17chosen), (5.14) 

where /(ll | 4, 5, 6, 15, 16, 17) denotes the probability of getting a vertex cover for the graph in 
Figure[T2lgiven that the vertices 4. 5. 6. 15, 16, 17 are chosen. The graph for /( 11 | 4,5,6, 15, 16, 17) 
is depicted in Figure [131 The probability of getting vertex covers for the two independent compo- 



0.1 2 3 4 5 6 7.8 9 10 




Figure 13. Lower Bound for /(ll). 

nents is /(4)/(4). Therefore from (|5.14l) . we get that 

/(10) > (l/2) 6 /(4)/(4) > (l/2) 6 /(4)/(6), 
with the last inequality since /(n) is decreasing. Therefore, in general we have 

(l/2) 6 /(m)/(n) < f(m + n) < f(m)f(n), 
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(5.15) 
(5.16) 



and so f(n) satisfies the conditions of (15.61) . By the modified version of Fekete's Lemma, we have 

f(n) = 9(A n ) (5.17) 
for some A. Therefore by (15. 12b , we have 

¥(k, k + 1, and k + 3 g A + A) = 6(A fe/2 ), (5.18) 

which proves Theorem 1 1.5 1 for the case a\ = 0, a 2 = 1, a 3 = 3. 

The general situation follows in exactly the same way: by first making the configuration graph 
of P(fc + ai, . . . , and k + a m £ A + A) look more symmetric and then using the modified Fekete's 
Lemma. 



6. Consecutive Missing Sums 
Now we prove Theorem [L6l which says that 



(k+m)/2 



< ¥(k + 1, ■ ■ ■ , and k + m A + A) < (- 



(k+m)/2 



:i + er 



(6.1) 



The lower bound comes from the construction in Figure [7] by letting the first (k + m)/2 elements 
of A be missing, which forces the first k + m elements of A + A to be missing as well. That is, 

P(0, 1, • • • , k + m - 1, and k + m £ A + A) = (i/2)( fc+m )/ 2+1 . (6.2) 

Therefore, we only need to prove the upper bound. 

Before giving the proof, we consider an example with condition graphs which illustrates the 
idea. Consider P(16, 17, 18, 19, 20 ^ A + A). The condition graph here is given in Figure [141 




Figure 14. Condition graph for P(16, 17, 18, 19, 20 £ A + A). 

We need to find the probability of getting a vertex cover for this graph. If we remove some 
edges, the probability of getting a vertex cover for the resulting graph is an upper bound for the 
probability of getting a vertex cover for the original graph. We can remove some edges to get the 
graph of Figure [TBI 






Figure 15. Graph after removing some edges. 
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The resulting graph has 3 ~ 21/6 components that are all complete bipartite graphs with 6 
vertices. These are easier to handle since the only way to get a vertex cover for such graphs is to 
have all vertices on one side be chosen. So the probability of getting a vertex cover for one of these 
complete bipartite components is less than (1/2) 3 + (1/2) 3 = 2/2 3 . Since the components are also 
independent, we have 



2 \ 3 / 1 \ 20/6 
PfKi.17. ls.19.20) < (- ~ I- . (6.3) 



and in general we get that 



(fc+4)/6 / i/ 3 x(fc+4)/2 



F(k,k + l,k + 2,k + 3,k + A) < f-J = - (6.4) 

We use this approach in the general proof. Notice that as m — > oo, the size of the complete bipar- 
tite graphs grows, and so we will be taking out relatively fewer and fewer constraints. Therefore, 
this approach gets us closer to the correct answer. 

Now we give a formal proof of Theorem 1 1.61 that does not rely on the condition graphs. 

Proof. We first do the proof for P( k, k + 1, ... , andA; + 2m — 1 ^ A + A) with 2m — 1 instead of m. 
Note that since the probability depends only on [0, k+2m] DA, we can assume that AC [0, k+2m] . 
We also assume that m divides k and that 

k = qm (6.5) 

with q even. 

We can write A as the following disjoint union: 

A = A U Ay U ■ ■ • U A q U A q+ y, (6.6) 

where 

Aj = A n [jm, (j + l)m - 1). (6.7) 
Then if [k, k + 2m - 1] n {A + A) = 0, then [k, k + 2m-l]n (Aj + A q _j) = for all j. Note that 

Aj + Aq-j C [k, k + 2m -2}. (6.8) 

Therefore, [k, k + 2m — 1] n (Aj + A q ^j) = implies Aj + A q -j — 0. If q is even, we have 

P(fc, k + 1, • • • , and k + 2m - 1 £ A + A) < P([jfe, k + 2m] n (Aj + Aq-j) = for all j < q/2) 

= F(Aj + A q - j = for all j < q/2) 
= F(Aj = or A q -j = for all j < q/2). (6.9) 

For different j, the pairs of sets Aj, A q _j are disjoint. Therefore, we have independence: 

3/2-1 

F(Aj = or A q -j = for all j < q/2) = F(A q/2 = 0) ] [ F(A ] = or A q -j = 0). (6.10) 

3=0 



Finally, note that 



P(A,- = 0orV; = 0) < P(A i = 0)+P(Vi = 0) = ^- C 6 - 11 ) 
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Combining dOi (ISTTOl) . and (I67TTT) . we find 



I 9/2-1 2 

P(M + 1,.-., andfc + 2m-l^A + A) < — JJ _ 



3=0 

9/2+1 

2 m 



I x (fc+2m)/2 



2 fc/2m - . (6.12) 



This inequality is true for all m, such that g = fc/m is an even integer. 
Changing m to m/ 2, we get that 



/ , \ (k+m)/2 

k + 1, . . . , and k + m - 1 £ A + A) < 2 fc/m f - J (6.13) 

for even m and q = k/m still an even integer. Note that (|6.13l) is similar to the bound we get in 
(16.41) using the condition graph approach. 

For odd m, we just need to use (16.131) . noting that 

P(fc, fc+1, . . . , k+m-1, and k+m £ A+A) < P(fc, jfe+1, . . . , and jfe+m-1 £ A+A). (6.14) 

For odd q, we need to partition A such that there is a block in the very middle of A. This ensures 
that this middle block is matched with itself (just like A q / 2 was matched with itself when q was 
even). This gives us the extra l/2 m that is needed in order to achieve the bound. For non-integer 
q, we need to repartition A in a similar way. Therefore the bound in (16.131) holds in general, up to 
a constant. 

Finally, note that as i — > oo, we have 2 1 l m — y 1. Writing 2 1 /" 1 = 1 + e m , we have 

/ 1 n (fc+m)/2 

P(fc, . . . , and k + m — l^A + A) < (-) (1 + e m ) fc , (6.15) 



2 y 

where e m — )■ as m — V oo. By raising 2 1//m = 1 + e m to the m th power, we see that 

e m < — . (6.16) 

m 



Therefore a weakened version of the inequality says that 

-m)/2 /-^n (fc+m)/2 

< P(/c + !,••-, and k + m ^ A + A) < (-) (l + e m ) fc , (6.17) 



2^ 

where the implied constants are independent of m, k. □ 

This bound is interesting since it means that the trivial lower bound is almost the right answer 
for the exact bound. The trivial lower bound makes us miss all of [0, k + m\ in A + A as seen in 
(16.21) but we only need [k + 1, k + m] to be missing. In this sense, we see that essentially the only 
way to miss m consecutive elements at k + 1 for large m is to miss all the previous elements as 
well. 
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7. Bounds on m(k), w{k), y(k), and z{k) for k < 32 

As mentioned in §11 .3l and covered in more detail in §17.21 it suffices to bound z{k). Our strategy 
is this: if D + D (where D is a uniformly chosen subset of N that contains 0) is missing exactly 7 
elements, then it is very likely that those 7 missing sums are all smaller than 90 and typically even 
all smaller than 45. If we loop overall 2 44 possibilities (3 for DD [0, 45), for each possibility we can 
compute (D+D) n [0,45) = (/3+/3)n[0, 45) and a subset of (D+D)n[45, 90) D (/3+/3)n[45, 90). 
From this (with a little theory to handle the tail of the sumset) we can bound the likelihood of 
missing exactly 7 sums, given D R [0, 45). By combining these estimates, we acquire bounds on 
z(7). " 

Let n > 2 be a natural number parameter (the computations reported here use 
n = 32, and we will replace with n = 45 sometime in June), and set 

z(Jfe|0) := P(|N\ (D + D)\ = k | Dn [0,n) = . (7.1) 

We have 

z(jfe) = ^ z(A; | /3)P (D n [0, n) = 0) = 2~ (n ~ 1) «(A;|/3), (V.2) 

06,SC[0,n) 0e/3C[0,n) 

so that it suffices to bound z(k \ (3) above and below for all < k < 32 (our arbitrary notion of 
"small k" is < k < 32) and all e P C [0, n). 
Further, set 



B 


= Dn [0,n) 


V 


= [0,n)\(f3 + f3) 


C 


= [n,2n)\(P + P) 


m 


= min£ 


T 


= [2n, oo) 


V 


= E[\[n,oo)\(D + D) \ B = (3] 


n 


_ 2-|/9n[0,m-n]| 



If we condition on B = (3, then the elements of V are Definitely missing from D + D, the 
elements of C are £ikely but not certain to be missing, and the elements of T, the Tail of the 
natural numbers, are very likely to be missing. Note that In — 1 6 C, so C is nonempty and m is 
well-defined. 

Lemma 7.1. For all k < \V\, we have z(k \ /3) = 0. 

Proo/ Conditioning on £ = (3, we have DCN\(D + D).In fact, V = [0, n) \ (D + D) 
Lemma 7.2. We have 

n = 5 .2-\ t3 \ + J22- lBn[0/ - n]l . 

Proof. By linearity of expectation 

77 := E[|[n,oo)\(D + D)|] = E [| [n, 2n) \ (D + D)\] +E[\T\(D + D)\] . 
Again using linearity of expectation, we have 

E[|[n,27i)\(D + D)|] = J2 F ^^ D + L> y 
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□ 

(7.4) 

(7.5) 
(7.6) 



Since £ D + D is the same as (for n < £ < 2n) 

1/2 



£D + D = /\(i£DV£-i£D) 



i=0 



f\ i-b^D. (7.7) 



b<e-n 



Thus 

F{£^D + D) = 2-l /3n ^- ri ]l, (7.8) 

and so 

£> + £>) = ^ 2 - |/3n[0 ' £ - n]l . (7.9) 
tec eec 

The proof that 

E[|T\ (D + D)\] = 5-2-^ (7.10) 

is essentially in [MO], but we sketch it here for the reader's convenience. Using linearity of expec- 
tation, 

oo 

E[\T\(D + D)\] = J2F(t#D + D) (7.11) 

t=2n 

and 



P (t # D + D) = w{([\t-b£D\[\{[\i(£DVt-i(£D 

\\bel3 ) \i=n t 



(7.12) 



Now this has two cases leading to 



l2- |/3| f3/4) ( *- 2n+1)/2 t is odd 

= {^.?g (3/ 4)<->/» <7 - i3 > 

The infinite sum (17.1 II) now simplifies 5- 2~ 1/31 . □ 
Lemma 7.3. We have max{0, 1 — t]} < z(\V\ | /?) < 1 — /i. 
Proof. Trivially z(\V \ \ fi) > 0. Since 

7? = E[|[n,oo)\(D + D)| | B = /3] 

oo 

= ^Tz(\v\+i\p) ■ i 

i=Q 
oo 

> £*(|Z>|+z|/3) 
1=1 

= (7.14) 

we also have z(\V\ | /3) > 1 — 77. 

Observe that the event |N \ (D + D)| > |X?| contains the event {m ^ D + D}, and so 

P (|N \ (£> + £>) I = |X> I ) < l-P(m^D + D) = 1 - fi, (7.15) 

concluding the proof of this lemma. □ 

27 



Lemma 7.4. max{0, 2/i - rj} < z(\V\ + 1 | f}) < min{l, rj}. 
Proof. Trivially < z(\V\ + 1 | (3) < 1. We have 



7] = k ■ z(\V\ + k | (3) > z(\V\ + 1 | (3), (7.16) 



k=0 



which leaves only the bound 2/i — r) < z(\V\ + 1 \ (3) to prove. 

The idea here is that if exactly \T>\ + 1 sums are missing, they are very likely to be the \V\ 
elements of V, and m. Formally, 

{|N\(£> + £>)| = |£>| + 1} 2 {m£D + D}r\ f] {£ e D + D} n f]{t e D + D} 

tec teT 

e>m 

( \ 



D {m^D + D}\ 



\J{l£D + D}u\J{t£D + D} 

\ eec teT 

\£>rn / 



(7.17) 

and so 

z(\V\ + l\/3) > ¥{m^D + D)- s ^¥{(,^D + D)- s ^¥{t^D + D) 

tec, teT 

i>m 

= 2P (m D + D) - p D + D) 

ieCuT 

= 2/i -77. (7.18) 

□ 

Lemma 7.5. For k > 2, < z(\V\ + k \ (3) < \ mm{r/, 2r] - 2/i}. 

k 

We note that sometimes this bound is weaker than z(\T>\ + k \ f3) < 1. This happens for few 
enough f3 that, from a computational vantage point, it is not worth checking for. 

Proof. Trivially, < z(\V\ + k \ (3). We have 

00 

v = J^i ■ Z (\V\ +i) > kz(\V\ + k), (7.19) 

i=0 

whence z(\T>\ + k) < rj/k. We also have 



i=0 



= z{\V\ + + ■ z(\V\+i) 



i=2 



> 2fi-i] + kz(\V\ + k), (7.20) 
and so z(\V\ + k) > (2i] — 2fi)/k. □ 
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7.1. Making the computation feasible, reliable, and verifiable. A massive computation is in 
order (we hope to iterate over 2 50 or so sets), and so some words are necessary as to how this is 
feasible. Set 

fmax{0,2 n - 2 n r/}, k = \V\ 

Lower (k | P) := i max{0, 2 • 2> - 2 n r/}, k = \V\ + 1 
I 0, otherwise 

'2 n -2>, k = \V\ 

UPPER (A; | (3) := { min{2 n , 2 n r/}, fc = |D| + 1 
0, otherwise 



Overhang (k \ (3) 



min{2 n r7, 2 • 2 n r] - 2 • 2», = |2?| 
0, otherwise. 



The lemmas above imply that that the vector 

2 2n - 1 (z(0), z(l), . . . «(31)> = J2 2 "^° I I /?)> • • • , *(31 I /?)} (7.21) 

0G/3C[0,n) 

is bounded below componentwise by 

J2 (Lower(0 I /3),Lower(1 I Lower (31 | /?)) (7.22) 

0S/3C[0,n) 

and is bounded above componentwise by 



^(Upper(0 I f3), Upper(1 \ (3),..., Upper(31 | f3)) + 



(Overhang(0 I (3), Overhang(1 \ (3),..., Overhang(31 | f$)) ■ M 



where M is the 32 x 32 matrix whose (i, j)th entry (running the indices from to 31) is if 
j > i + 2, and is otherwise. This allows us to compute an upper bound on z(0), . . . , z(31) from 

Y (Upper(0 I p), Upper(1 I /?),..., Upper(31 | p)) (7.23) 

06/3C[0,n) 

and 

Y (Overhang(0 I P), Overhang(1 I /?),... , Overhang(31 | p)). (7.24) 

0S/3C[0,n) 

Observe that Lower, Upper and Overhang are always integral, as 2 n /i and 2 n r] are both inte- 
gers; this means that we can compute (17.221) . (17.231) and (17.241) using only integer arithmetic. 

We need to compute P + p and P fl [0, k] (for various k) for each p. This work can be tremen- 
dously reduced by using a Gray code. That is, the subsets of [l,n) can be enumerated in such 
a way that each set differs from its predecessor in only one element (either put in or taken out). 
By storing the representation function for P + P (that is, the number of times each sum can be 
written as a sum of two elements of P), we can simply update the necessary computations instead 
of re-computing. 
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Unfortunately, the size of the computation requires us to use 2n + 1-bit integers, and this is not a 
supported data type in most languages for n > 32. The options of using C with GMP, Mathematica, 
or some other route to arbitrary size integers is prohibited by the size of our computation and the 
modesty of our actual needs (we add, but never multiply, and know a priori the number of bits we 
need) we choose to represent our numbers as arrays of 64-bit integers in C++. To further extend 
our reach, we run the code on the parallel computing cluster at HPCC. To facilitate parallelization, 
we break j3 into ft = (3 fl [0, ni) and ft = (3 fl [ni, n). This makes the algorithm "embarrassingly 
parallel", and allows us to store intermediate calculations both to recover from any system or power 
failings, and to allow for spot checking of results. 

To ensure correctness of the results, we have written the code in Mathematica using the simplest 
algorithms conceivable. Such code becomes intractably slow around n « 25, but this provides a 
sequence of values against which we can test our progressively more subtly written code, both in 
Mathematica and in C++. Our most sophisticated code is in C++. 

Finally, we have the bounds on P (|N \ (D + D)\ = k \ D n [0, 2 16 ) = ft) for all ft in a pub- 
licly available file. We invite the reader to spot check our implementation. 

7.2. Obtaining y(k), m(k), and w(k) from z(k). While it is clear that z(k) is defined, that 
is, the event "|N \ (D + D)\ = k" is measurable, it is less clear that z(oo) = 0. This, and that 
y(oo) = 0, follows from the Borel-Cantelli lemma and bounds such as (11.161) . We can define D (a 
uniformly chosen subset of N containing 0) as C — min C (where C is a uniformly chosen subset 
of N), and so 

oo 

y {k) = ^>(minC = i AND |N\ (C + C)\ = k) 

i=0 
oo 

= ^ P (min C = i AND |N \ ((C - min C) + (C-minC))| = k - 2i) 

i=0 
Lfc/2j 

= ^ P(minC = i)P(|N\ (D + D)| = k - 2i) 

i=0 

Lfe/2J x 

i=0 

To obtain the formulas 

k k 

i(k) = ^r,y(i)y(k-i), w(k) = ^z(i)z(k -i) (7.26) 



m( 

i=0 i=0 



we refer the reader to flU. The gist of the argument is that as P ([n/2, 3n/2] C A + A) — > 1, 
m(k) := P(|[0,2n-2] \ (A + A)\ = k) 

k 

~ 5^P(|[0,n/2) \ (A + A)\ = i AND |(3n/2, 2n - 2] \ A + A\ = k - i) . (7.27) 



i=0 
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Since A + An[0, n/2) is only affected by An [0, n/2) and A + An (3ra/2, 2n - 2] is only affected 
by A fl (n/2, n), we can use independence to write 

fc 

m (jfe) ~ ^F(\[0,n/2)\(A + A)\=i)F(\(3n/2,2n-2]\A + A\=k-i). (7.28) 

i=0 

As n — > oo, the set [0, n/2) \ (A + A) looks more and more like N \ (C + C), so that 

P(|[0,n/2)\(A + A)|=i) -> </(z), (7.29) 

and similarly (after replacing A with n — 1 — A) for P (|(3n/2, 2n — 2] \ A + A| = k — i). The 
argument for w(k) is identical, but with "D" in place "C". 

Let Z\, Z 2 be independent random variables with the same distribution as M N |{ }, and set W : = 

Zi + Z 2 . Then F (W = k) = Yli=o z (^) z (^ ~ *) = whence X^o^W = 1> an( ^ similarly 

Since y(k) is a linear combination of z(0), . . . , z(k) with positive coefficients, the lower bounds 
on z(0), . . . , z(k) immediately give a lower bound on y(k), and likewise upper bounds on z(0), . . . , z(fc) 
yield an upper bound on y(k). The situation is the same between y and m and between z and w, 
even though the combination is not linear! 

To experimentally estimate z(k), we hypothesized that F(N\(D + D) % [0,256)) is suffi- 
ciently small as to be ignored. Then, using Mathematica 8, we generated 2 28 pseudorandom subsets 
E of [0, 256), forced each to contain 0, and then computed k := |[0, 256) \ (E + E)\ and kept a 
running tally of the number of times each value of k arose. This estimates (with an enormous 
sample size) 

P (|N \ (D + D)\ = k | N \ (D + D) C [0, 256)) w «(ife). 

The estimates along with conservative 99.9% confidence intervals, are given in Table [T6l 

and shown in Figure [2l The implied bounds on w, m, and y are given in Tables [T9l [TBI and [171 
respectively, and shown in Figure [21 The tables are collected together in Appendix lAl 

8. Conjectures and Future Research 

We end with some conjectures that are supported by numerical data, and discuss some additional 
projects. Our main conjecture remains Conjecture 1 1.21 which says that the distribution of missing 
sums is approximately exponential. One possible method of studying this distribution is finding 
where the first present sum in A + A occurs, given that A + A has k missing elements. Recall that 
the lower bound in ^4] was proven by constructing A such that M n (A) = k by letting the first k/2 
elements of A be missing. In this case, the index of the first present sum in A + A occurs at index 
k. But from numerical data, the index of the first present element will not be k for typical A + A 
that is missing k elements. This also suggests that this trivial construction does not account for the 
real 'random' way of constructing A such that A + A is missing k elements, which is consistent 
with the fact that conjectured decay constant for the distribution is 0.78 but the lower bound gives 
only the decay constant 0.70. Even though the index of the first present element is not k, from 
numerical data, the index seems to be linear in k. 

To be precise, let X n (A) = min{m : m — 1 G A + A} be the index of the first present sum of 
A + A. Then we have the following conjecture. 

Conjecture 8.1. For large k, 

lim E(X n (A) | M n (A) = k) (8.1) 

n— >oo 
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is approximately linear in k. 

Here we have a limit in n largely to emphasize that the terms do not depend on n but only on k; 
however, it may be possible to prove that for any fixed k, this limit exists. It may also be possible 
to remove the limit by having k < 0.2n. 

Similarly, we can consider the first index by which A + A has all of its k missing elements; that 
is, Y n (A) = min{m : [0, m — 1] D (A + A) has k missing sums}. If Y n (A) = k, then A + A 
is missing exactly the first k elements, which is the same situation as in the trivial lower bound; 
however, for typical A this does not seem to be the case, and we conjecture 

Conjecture 8.2. For large k, 

lim E(Y n (A) | M n (A) = k) (8.2) 

n— >oo 

is approximately linear in k. 

Another direction is to improve the exponential bounds for F(M n (A) = k). One approach to do 
this is to find upper bounds on probabilities like P( ttl, • • • , dm ) for arbitrary ai, ei2, • • • ,a m around 
k. Recall that in §g]we first used P(z ^ A+A) to get an upper bound for P(M„ (A) = k) of (3/4) fc/2 
and then used P(z, j ^ A + A) to get a bound of (0/2) fc , an improvement. Knowing P( 
would result in similar improvement. Using the current approach, this would require studying the 
number of vertex covers for graphs that have vertices with degree m instead of 2. 

Note that we already have an upper bound F(k + 1, • • • , k + m) from §[6] since this is the case of 
consecutive missing sums. This probability tends towards (l/2)( fe+m )/ 2 , which matches the trivial 
lower bound. If ¥(k + a%, k + a 2 , ■ ■ ■ , k + a m ) < ¥(k + 1, • • • ,k + m) for any a%, ■ ■ ■ , a m , then the 
probability for missing k elements would be (l/2)( fe+m )/ 2 . Since this does not seem to be the case 
from numerical data, the probability of missing m elements around k is asymptotically different 
from the probability of missing m consecutive elements around k; that is A aii ... |0m from Theorem 
P is not close to (1/2) 

Note that it is possible to use the graph-theoretic approach to study higher moments of M n . 
Recall that the variance was calculated by finding explicit formulas for P(i and j ^ A + A). 
Similarly, the mth moment can be found by finding explicit formulas for P(ai, ■ • • , and a m ^ A + 
A) for arbitrary which requires finding the number of vertex covers in certain graphs 

that have vertices with degree m. Note that we again need to study P(a 1; • ■ ■ , and a rn A + A), 
as we do when we try to improve the bounds for F(M n (A) = k); however now we need an exact 
formula for P(ax, • ■ ■ , and a m £ A + A), whereas before we just needed an upper bound. 

Finally, the hypergraph arguments of this paper generalize to linear forms such as {£a\ + ma^ : 
ai, a% E A} (with £, m fixed integers). It would be interesting to carry out this generalization, as 
well as considering the case with three (or more) summands of A. 

Appendix A. Data tables for distributions 
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Figure 16. The first and last columns give our rigorous lower and upper bounds 
on 10 5 2;(A;). The second and fourth columns give the bounds of a conservative 

99.9% confidence interval (CI) for 10 5 2;(A;). The middle column gives our best 

guess for the integer closest to 10 5 z(A;), which we denote 10 5 z(k). 
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Figure 17. The first and last columns give our rigorous lower and upper bounds 
on 10 5 y(k). The second and fourth columns give the bounds of a conservative 

99.9% confidence interval (CI) for 10 5 y(k). The middle column gives our best 

guess for the integer closest to 10 5 y(k), which we denote 10 5 y(k). 
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Figure 18. The first and last columns give our rigorous lower and upper bounds 
on 10 5 m(A;). The second and fourth columns give the bounds of a conservative 

99.9% confidence interval (CI) for 10 5 m(/c). The middle column gives our best 

guess for the integer closest to 10 5 m(/c), which we denote 10 5 m(A;). 
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Figure 19. The first and last columns give our rigorous lower and upper bounds 
on 10 5 w(k). The second and fourth columns give the bounds of a conservative 

99.9% confidence interval (CI) for 10 5 u>(/c). The middle column gives our best 

guess for the integer closest to 10 5 w(A;), which we denote 10 5 u>(A;). 
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